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Abstract: Searching for evidence of neutrino oscillations is an important prob- 
lem in particle physics. Suppose that evidence for neutrino oscillations from 
an LSND experiment reports a significant positive oscillation probability, but 
that the LSND result is not confirmed by other experiments. In statistics, such 
a problem can be proposed as the detection of signal events in the Poisson sig- 
nal plus background model. Suppose that an observed count X is of the form 
X = B + S, where the background B and the signal S are independent Poisson 
random variables with parameters b and 8 respectively, b is known but 8 is not. 
Some recent articles have suggested conditioning on the observed bound for 
B\ that is, if X = n is observed, the suggestion is to base the inference on 
the conditional distribution of X given B < n. This suggestion is used here 
to derive an estimator of the probability of the existence of the signal event. 
The estimator is examined from the view of decision theory and is shown to 
be admissible. 



1. Introduction 

In some problems, a signal S may be combined with a background B, leaving an 
observed count X = B + S. Here we suppose that B and S are independent Pois- 
son random variables with means b and 9 respectively so that X has a Poisson 
distribution with mean b + 9. Further b is assumed known but 9 is not, as might 
be appropriate if there were historical data on the background only. Models of this 
nature arise in astronomy and high energy physics. One of the most interesting 
problems is the issue in experiments of neutrino oscillations. Three types of neutri- 
nos have been identified. They are denoted by z/ e , and v T respectively. A task of 
the experiments is a search for za, oscillating into v e . There are 3 sources of back- 
ground. One is from the real v e and the other two are from two different mistaken 
identifications [H, HH. For example, the KARMEN Group has been searching for 
neutrino oscillations reported from an earlier experiment at Rutherford Laboratory 
Detector in the United Kingdom. They had expected to see about 15.8 background 
events and had observed 15 events total Q. This example and others have sparked 
interest in statistical inference when maximum likelihood estimators are on or near a 
physical boundary of the parameter space. Recent work along these lines is reviewed 
by Mandelkern [9( and discussants. Here, we consider the signal existence problem 
in an experiment. It could be either an estimation problem of the probability of 
the existence of the signal events or a testing problem of the existence of the signal 
events. The formulation for the estimation problem, estimating Pg[S > 0\X = n] 
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instead of 9 itself, is equivalent to estimating the parameter p — pe(n) — b n /(b+9) n 
given observed X = n since P g [S = 0\X = n] = b n /(b + 9) n . 

The quantity 1 — p is the conditional oscillation probability in experiments in 
searching the evidence of neutrino oscillations in particle physics. In the past few 
years, tremendous progress has been achieved to firmly establish the nature of 
neutrino oscillations using neutrinos from the sun [6j. However, this situation is 
still unsettled in accelerator based experiments. The only evidence for oscillations in 
the appearance mode is from the LSND experiment [l| , which reports an oscillation 
probability of 0.264% plus or minus measurement errors. However, the result from 
the LSND experiment is not confirmed from the KARMEN group [3, 0]j which 
reports no evidence of neutrino oscillations. The result from the LSND experiment is 
also partially excluded from accelerator based experiments, such as the NuTev group 
Q. Recently, the Mini-Boone experiment has started receiving data at Fermilab 
and will either confirm or refute the LSND effect, but the results won't be available 
until in a few more years [13]. If the neutrino oscillation results were confirmed, 
experiments would provide a precision measurement of the oscillation parameters 



Suppose that X = n is observed and consider the problem of estimating p = 
b n /(b + 9) n for 9 > 0. Let x V y = ma,x(x,y) and x A y = mm(x,y) for any x 
and y. Substituting the maximum likelihood estimator (MLE) 9 — max(0, n — b) = 
OV(n-b) into the expression, the MLE of p is given by pg = p§(n) = b n /(bV n) n . 
That is the MLE takes the ratio of the n-th power of the background parameter 
and the n-th power of the maximum of the background parameter and the observed 
counts. An obvious disadvantage of pg is that it is very spiky at b. For example, 
Pg — 1 if n < b and Pg w e - *-"" 6 -' if n is slightly larger than b for large n. This 
indicates that only a few more observations will make the MLE significantly change 
from 1 to a number close to no matter how large b is. Note that the background B is 
ancillary since its distribution is completely known. Even though it is not observed, 
its bound is. If X = n, then B < n. In this spirit, let p(n) — P(B — n\B < n), and 
observe that p is computable since b is known. Thus, p is the conditional probability 
of the number of background events equal to the observed counts, given the observed 
bound n < b; and < p < 1 for all n > 1. For the data reported by the KARMEN 
group, b = 15.8, X = 15. Since P[B = 15\B < 15] = 0.206 and 9 = 0, p = 0.206 
and ps = 1. 

The admissibility or the inadmissibility of the two estimators p and Pg will be 
discussed in Section 2. It is shown that p is an admissible estimator of p under the 
squared error loss and pg is an inadmissible estimator. It is also shown that p is 
generalized Bayes with respect to the uniform prior over [0, oo). The problems of 
the confidence intervals for p will be discussed in Section 3. The testing problem for 
the existence of the signal events will be discussed in Section 4. Section 4 will also 
discuss a modification of the Type I error rate for future studies on the problem of 
the existence of signal events. 



2. (In)admissibility 

It is convenient to let / M and be the probability mass function (PMF) and the 
cumulative distribution function (CDF) of the Poisson distribution with mean /i. 
Then 

(1) p{n) - P(B - n\B < n) - AM. 

F b (n) 
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It is also convenient to abbreviate p{X) and p§{X) by p and p@ respectively. Let p 
be any estimator of p. Then under the squared error loss, the risk of p is 



(2) 



R(p 7 9)=E8l(p-pf] = J2^ 



b n 



71=0 



(b + 6) n - 



2 fb+e(n). 



We say p is inadmissible if there is an estimator p' for which R(p' ,p) < R(p 1 p) for 
all 9 > with strict inequality for some 9. Otherwise, we say p is admissible. 

Inadmissibility of the MLE. If n is a er-finite measure over [0, oo), write E v for 
integration with respect to the joint distribution of 9 and X when 9 ~ 7r; and 
write -E"'( > |n) for conditional expectation given X — n. Further, let R(p,n) — 
J °° R(p, 9)ir(d9) = E^Kp—p) 2 ] for an estimator p of p; and let RM = -R(?5, 7r). 
The inadmissibility of the MLE p^ will be obtained from Stein [ll| . 

Theorem 1. If b > 1, then the MLE pg — b n /(bV n) n is inadmissible. 



Proof. From Stein ll| , a necessary and sufficient condition for pg to be admissible 
is that, for every 9q 6 [0, oo) and e > 0, there is a hnite prior tt for which tt{9o} > 1 
and R(pg,Tr) — R(tt) < e. In particular, if pg is to be admissible, then there must 
be a sequence ~K k of hnite priors for which 7rfc{l} > 1 for very k = 1, 2, . . . and 

(3) lim R{ P§ ,n k ) - Rfa) = 0. 

k — >oc 

Suppose © is true. Let p k (n) = E* k [b n /(b + 9) n \n}. Then 



oo „ c 

R(P§^k) ~ ^(TTfe) = y2lP§(n) -Pk(n)} 2 I 



(6 + 6*)™ 



(b+e) 



■;?.! 



7r fc (dfl) 



n=0 



So, lim fc _ 00 p fe (n) = p e -(n) = b n /(b V n) n for all n = 0, 1,2, . 



Note that 



Pk(n) 



Let 7r*(d6>) 



f o °° e-«7r fe (de) 



f b+e (n)n k (d9) 
fb+e(n)n k (d9) 
Then, we have 



6" 



f b+ g(n)ir k (d9) 



(b + 9) 
f b (n)e- e 7r k (d9). 



J °° f b (n)e- s 7r k (d0) .. / °° fe"7r£(dfl) 6« 
lim p k (n) = hm — ^ = hm „~ = 

fc-~ fc-oo f °° f b+0 (n)w k (d9) h->ooj™(b + e)»*i(d9) (bV' 



Let /Ltb,fe,„ = J °° (b + 9) n nl(d9). Note that ir k is a probability measure. Then, the 
above expression implies 



lim Hb,k,n = {by n) r 



for all n = 0,1,2, 
(4) 



This requires 



lim /J,h,k,n = 

k — »oo 



b n , 



if n < to 
if n > to 
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where m — [&J is the greatest integer not greater than b. Suppose a probability 
distribution G has moments ji n — b n or ji n — n n with respect to n < m or n > m 
respectively. Then G is determined by its moments since X^^Lo r! ™ r ™/ r i! converges if 
r < e _1 . Thus (j4|) requires that the distribution function Gk(z) = irl{9 ■ b + 8 < z] 
converge weakly to the distribution function G (see Section 30). For b > 1, 
it requires pi = 6. Then G must be degenerate at b so all /i„ = b for n > 1. 
Therefore, such a G can not exist. So, the assumed existence of irk in © leads to 
a contradiction for 6 > 1. □ 

Admissibility of p. The admissibility of p will be deduced from a Bayesian ap- 
proximation. Consider the priors ir a (d9) = e~°" e for 9 > and a > 0. Then, 7r Q 
is finite if a > and 7To is the infinite uniform distribution over [0, oo). Write E a 
for integration with respect to 7r a , E a (-\n) for posterior expectation, and R(p, a) = 
E a [(p — p) 2 ] = J °° R(p, 9)e~ a6 d9 for the integrated risk of an estimator p. This is 
minimized by p a = E a (jp\n). 

The computation of p a is straightforward. First the joint distribution of (X, 9) 
under 7r Q is 

= (b+er M+a)6jn p _ h ^ bH— 

k\{n-k)\ 



P a [X = n, d9] = K " ' e- b - [ - 1+a)ti d9 = e~" V , " " , Sl e- {1+a)0 d9. 

L J n, ^ kUn — fcl 



fe=0 



The marginal PMF of X under 7r Q is 

b k 



P a [X = n] =e~ b J2 



fc=0 



k\{n-k)\ J 



y 

^ mi 



k=0 



e n-k e -(l+ a )0 de 

(1 + a )n-k+i ~ (i + a )«+i F ( 1+ «) fc(n) - 



The posterior density of # under ir a is 

„q6 



(1 + a) n + 



J F {l+a)b( n ) 



Then 



Pa (n) = E a (p\n) 



(5) 



(1 + a) 



^^(l+a)i(») 



(b + g)" e _ 6 - (1+a)9(fgj 



(6 + g) w ._ 6 _ fl+aV 



(b + 8Y 



d6 



(A 6 



b" 



f(i+a)b(n) 



n!(l + a) % a )b(n) 

From the expression of p in (TT|), one has po = P- Let <5 aj „ = f(i+ a )b{ n ) I F(i+a)b( 7 
and denote 5o,n by S n . Then, p a = 5 a>n and p = S n . 

Theorem 2. //p Q and p are as in ([5]) and ((TJ), iften lim a ^o E a {p a — p) 2 = 0. 
Proof. Consider 

/•OO 

^"(Pa -P) 2 = / E e (p a -p)e- a6 d9 
Jo 

Jo ±^ « ! 



< 



<y 5 
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Since 

\S a , n - S n \ 2 < Sl n + Si < 5 a , n + S n < e^ b f {1+a)b (n) + e b f b (n) 
and the right side is summable over n > 0, 

oo oo 

lim E a (p a -p) 2 < lim y> a ,n - tnf = Y\ Km (S a , n - S n ) 2 = 0. 

a—>0 a^O z — J £ — * a— >0 

n=0 n=0 

from the Dominated Convergence Theorem. □ 

Since p a , p are less than 1 and for any estimator p with < p < 1 , (p a —p) (p a —p) 
is absolutely bounded by 4. By the Dominated Convergence Theorem for the second 
equality below, we have 

E a {(p a -p)(p a -p)} 

- rf>M -P(n)]\p a (n) - ^—]^±pie-^e-^d9 
J a ±< (b + 9) n n\ 



n=0 



b" 



{b+oy {b+B) g 



(b + 6) n> n\ 



r-OO 

= ^2\Pa{n)-p(n)] [p a (n) 

n=0 J ° 

Then for a > 0, R(p,a) = R{p a , a) + E a [{p - p a ) 2 ]. 
Corollary 1. lim Q ^o [R{p, a) — R(p a ,a)] = 0. 

Proof. For a > 0, R(p, a) = E a [(p a - p) 2 + ( Pa - p) 2 } = R{p, a) + E a [(p a - p) 2 ] 
and E a [(p — p a ) 2 } — > as a — > by the theorem. □ 

Since < p < 1 for any estimator of p (under consideration), it is clear that 
from the Dominated Convergence Theorem that R{p, 6) is continuous in for any 
estimator p of p. 

Corollary 2. p is admissible. 

Proof. If p were inadmissible, then there would be a p for which R(p, 9) < R(p,8) for 
all e > and ) < R{p, 6 ) for some 6 > 0. Let e = 6> ) - R(p, )]/2. 
Then, there exist an r] > 0, such that R(p, 0) > R(p, 9) + eo for all non-negative 9 
such that \6 — O \ < r). Then 

R(p, a) — R(p a , ct) > R(p, a) — R(P, ol) 

rdo + n 

> / [R{p,9) - R(p,9)}e- a6 d9 
Je 

e ~u0 o _ e ~a(9 +ri) 

>e > e ri > 

a 

as a — > 0, contradicting Corollary 1. □ 



Of course, it does not follow that the risk of p dominates that of px. The mean 
squared errors of the two estimators are shown in Figure [1] The graphs show that 
the MSEs for p are greater than the MESs for pg for small 9 but less than the MSEs 
for pg for moderate and large 9. 
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Fig 1. The MSE for p (Solid) and pg (dotted) for selected b. 



3. Confidence and credible interval 



Three confidence or credible intervals are considered in this section. They are the 
unified confidence interval proposed by Feldman and Cousins [8| , the conditional frc- 
quentist confidence interval proposed by Roe and Woodroofe [l0( , and the Bayesian 
credible interval. 

The unified confidence interval for p is obtained from the unified approach. It 
is a transformed interval from the unified confidence interval for 9. In the unified 
approach, one considers the likelihood ratio 

fb+e(k) 



R(9\k) = 



The unified confidence interval for 9 is consisted of taking those 9 for which R(9\k) > 
c(9) , where c{9) is the largest value of c for which 



E fb+e(k) > 1 

k:R(6\k)>c 



a. 



The conditional frequentist confidence interval for p is also a transformed interval 
from the conditional frequentist interval for 9. In this approach, one modifies the 
Poisson PMF by the conditional distribution on B < n as 



qUk) 



f b+e (k)/F b (n), 
EUhti)fe(k-3)/F b (n), 

and then considers the modified likelihood ratio as 

QUk) 



if k < n 
if k > n, 



R n (9\k) 



maxf?/ qj^ 9 ,(k) ' 
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Let c n (9) be the largest value of c for which 

qle(k)>l-a. 

k:R n (e\k)>c 

The conditional frequentist confidence interval consists of those 9 for which 
R n {9\k)>c n (9). 

Let [£ u (n), u u (n)] and [£ c (n), u c (n)] be the unified confidence interval and the 
conditional frequentist interval for 9 respectively. Since both of them are transfor- 
mation invariant by their definitions, the unified confidence interval [£ u (n), u u (n)] 
and the conditional frequentist confidence interval [£ c {n), u c {n)\ for p are 

b n b n 

[£ u (n),u u (n)] 



L (b + u u (n)y^ (b + £ u (n)r' 
and 

b n b n 
\£ c (n),u c (n)] = \— , . , = 1 

respectively. 

Recall that the point estimation p of p is also the Bayesian estimator of p under 
the uniform prior. The Bayesian credible interval for p is considered under the 
uniform prior for 9 with the least interval length. The method to compute such an 
interval is introduced in Berger's book ( [3] , P- 266). 

Straightforwardly, the posterior density of 9 under the uniform prior for 9 over 
[0, oo ) is 



g(9\n) 



fb+e(n) _ _J_(b + 9T_ {b+e) 



J™f b+e (n)d9 F b (n) n 



Substituting 9 = b(p » — 1) above and multiplying the absolute value of d9/dp = 
—bp ~ jn on the right side of the above equation, one obtains the posterior density 
function of p, < p < 1, given X — n as 



,d6, b n+1 2„+i 

-p n e 



g{p\n) = g(b(p- 1/n " 1)N?I = — ^ 



dp n\nFf,(n) 

Note that g(p\n) is monotone increasing in p if b > 2n + 1 and is unimodal in p 
with mode (j^rx)™ if & < 2n + 1. Let p max = (5^1)" A 1- Then g(p\n) is uniquely 
maximized at p max , increasing in p if p < p max and decreasing in p if p > p max for 
< p < 1. The 1 — a level Bayesian credible interval [£, u] — [£(n), u(n)] for p can 
be uniquely solved from 

(6) G(u\n) - G(l\n) = 1 - a 

and 



(7) [£,u} = {p:g(p\n)>c n } 

for some c„ > 0, where G(-\n) is the posterior distribution of p given X = n under 
the uniform prior for 9. That the interval solved from ([6]) and ||7J) is the shortest 
interval among those satisfying condition ((6| is based on Berger's book. 

The algorithm to compute the 1 — a level Bayesian credible interval [£, u] for p 
can be described as follows: 
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Table 1 

90% confidence and credible intervals for p when n is close to b for selected b 



b 


n 


Unified 


Conditional Frequontist 


B ayes i an 


1 







n 11 






I 


\ 


1 Zi. j£X I JLU J f J. J 




11 
■ 1 l 


Ci R9T 1 n -1 n s^9l 


I 


j 


9 8 ,: Ulf) — 2> ) 1 

lii.OvJV J-W l 5 -LI 




11 




2 


1 


[3. 89(10-!), 1] 


[4.42(10 _1 ) 


.1] 


[4.00(10" 1 ), 1] 


2 


2 


[1.08(10 _1 ), 1] 


[1.15C10- 1 ) 


,1] 


[4.49(10-2), 0.801] 


2 


3 


[2.02(10~ 2 ), 1] 


[1.95(10~ 2 ) 


.1] 


[2.23(10" 3 ), 0.555] 


2 


1 


[2.84(10 -3 ), 0.952] 


[2.93(10~ 3 ) 


,1] 


[8.21(10~ 5 ), 0.273] 


5 


4 


[7.20(10- 2 ), 1] 


[1.14(10 _1 ) 


-1] 


[1.53(10-2), 0.823] 


5 


5 


[2.24(10- 2 ), 1] 


[3.15(10- 2 ) 


,1] 


[1.06(10- 3 ), 0.712] 


5 


6 


[5.67(10" 3 ), 1] 


[6.86(10" 3 ) 


■1] 


[5.85(10" 5 ), 0.564] 


5 


7 


[1.37(10- 3 ), 1] 


[1.61(10- 3 ) 


.1] 


[2.78(10" 6 ), 0.384] 


10 


9 


[1.17(10- 2 ), 1] 


[2.18(10-2) 


.1] 


[3.33(10- 5 ), 0.719] 


10 


10 


[4.02(10" 3 ), 1] 


[6.68(10- 3 ) 


.1] 


[2.10(10- 6 ), 0.633] 


10 


11 


[1.53(10" 3 ), 1] 


[1.75(10- 3 ) 


.1] 


[1.20(H)- 7 ), 0.528] 


10 


12 


[4.07(10~ 4 ), 1] 


[4.51(10- 4 ) 


,1] 


[6.29(10" 9 ), 0.406] 


10 


1.3 


[9.71(10" 5 ), 0.834] 


[1.18(10" 4 ) 


■1] 


[3.15(10- 10 ),0.278] 


10 


14 


[2.09(10~ 5 ), 0.501] 


[2.22(10" 5 ) 


,1] 


[1.54(10- n ), 0.163] 



(i) Find a z so that G{z\n) = a. 

(ii) If g(z\n) < g(l|n), then ^ = z and u = 1. Otherwise, repeat step (iii) until 
convergence. 

(iii) Let c% = g(z\n) and c 2 = g(l\n). Let c = (ci + c 2 )/2. Findp^ < p max < p u so 
that g(p<>) = g{p u ) = Co- If G{p u ) — G(pe) > 1 — a then let c 2 = Co; otherwise 
let Ci = Co and iterate. 

Table [T] lists the lower bound and upper bound of the confidence and credible 
intervals for p as functions of n when n is close to the background parameter b for 
selected b. 

The frequentist coverage probability of the confidence and credible intervals for 
p is the probability of the interval to contain the true value of pe(n) = b n / (b + 9) n . 
The numerical results of the frequentist coverage probability of the 90% confidence 
and credible intervals are given in Figure [21 As suggested by Figure [21 the Bayesian 
credible interval has a low coverage when 9 is close to and it steadily increases to 
almost 0.9 as 9 becomes large. However, since the length of the Bayesian credible 
interval is much shorter than the conditional frequentist and the unified intervals 
from the numerical result, Figure [2] does not suggest that the Bayesian credible 
interval is worse than the other two. The curves of coverage probabilities fluctuate 
up and down since the coverage probabilities are discontinuous in 9. 

4. A testing problem 

An interesting problem in particle physics is to determine if there exist any signal 
events in an experiment. In statistics, this problem can be described as a testing 
problem for the null hypothesis of Hq : S = versus the alternative hypothesis of 
H x : S > 0. Recall that p = Pg[S = 0\X = n] = b n /(b + 9) n and note that 9 is 
unknown. The probability p should be estimated. As we have seen, two estimators 
have been discussed. The first one is p and the second one is the MLE pg. Since Pg 
is inadmissible, we only discuss p(n) = fb(ri) / F&(n) here. 

An estimation problem here is to estimate the indicator function Is>o instead of 
its probability. As an estimator of /s>o, we will show that 1 — p is also admissible 
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8 10 




Conditional Frequentist 

Unified 

Bayesian 



8 10 



Conditional Frequentist 
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Bayesian 



Conditional Frequentist 

Unified 



FlG 2. Frequentist coverage probability of the 90% confidence and credible intervals for p as 
functions of 6 for selected b. 



under the squared error loss, where the squared error loss is defined by L(<5, S) — 
(6 — Is>o) 2 for an estimator S of Is>o- 

As before, we still consider the priors 7T a (9) — e~ a9 , 6 > for a > 0, but here we 
define ii(($,0) = E 9 [(6-I s >o) 2 ] and R(S, a) = E a [{5 - I s>0 ) 2 ] = J °° R{8,6)e-« e d6. 
This is minimized by E a (Is>o\ n )- 

The computation of E a (Is>o\n) is still straightforward. Since P$[B — k,X = 
n] = b k d n ~ k e~( b+d ' /[k\(n — k)\] follows the joint posterior mass function, for n > 0, 

b h e~ b 



P a \B = k,X = n]= / P e [B =k,X = n} e - a0 d6 = — - 
Jo kl(l 

and the marginal posterior mass function of X, for n > 0, 

« ab 

P a [X = n]= YJ~[B = k,X = n}= {i + a)n+1 F {1+a)b (n). 

Then, the conditional posterior mass function of B = k given X = n is 

(l + a) k e-^ b _ f {1+a)b {k) 



Yn-fc+1 ' 



P a \B = k\X = n} = 



This gives 



E a (I s>0 \n) = Y,P a lB = k\X = n} = 



fe=0 



klF (1+a)b (n) F {1+a)b (n)' 

F(i+a)b(n - 1) 
F(i +a )b(n) 



Since E a (Is- > a\n) = 1 — p a {n) where p a (n) is given by ([5]) and since for a > there 
is also the identity R(p, a) = R(p,a) + E a [(p a — p) 2 }, by Theorem [2] one can see 
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that 1 — p is also an admissible estimator of the indicator function Is>o- Details of 
the proof is omitted here since there are many overlaps in the proof. 

The mean squared errors of the two estimators p and pg as estimators of Is>o 
are identical to the mean squared errors of the two estimators as estimators of 1— p, 
which can also be seen in Figure [T] However as estimators of Is>o, one can also 
study the conditional risk, where the conditional risk of an estimator 6 of Is>a is 
defined by Eg[(5 — Is>o) 2 \B <n]. As a result, the conditional risk of 1 — p is 

(8) E e [(p - I s=0 f\B <n] = V(I B=n \B <n)= l) 

and the conditional risk of the MLE 1 — pg (n) is 

E e [(p s ~ I s =o) 2 \B < n] =V{I B =n\B < 7i) + \p § (n) - p(n)} 2 

(9) f b (n)F b {n-l) , J b (n) b n l2 



F£{n) F b {n) (bVn) n> 

Obviously they do not depend on 9 and ([5]) is less than ©. The conditional risks of 
1 — p and the MLE 1 — pp as functions of n for selected b are displayed in Figure [3] 
Type I error rate can be modified in the testing problem of the null hypothesis 
of H : S = versus the alternative hypothesis of Hi : S > 0. Suppose the null 
hypothesis is rejected if n > no, where no is a fixed integer. Then, the classical type 
I error rate is 

a(no) = Pe[X > n \S = 0] = P b [X > n ] = 1 - F b (n - 1). 
Based on the estimator p of p, a modified type I error rate is proposed as follows. 




b=5 b=10 




02468 5 10 15 



Fig 3. Conditional risk of estimators of Is>o as functions of n for selected b. 
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Table 2 

Least no for a(ng) < 0.1 and a* (no) < 0.1 respectively 



b 


Classical 


Modified 


1 


3 


3 


2 


5 


5 




6 


6 


4 


8 


7 


5 


9 


8 


6 


10 


9 


7 


11 


10 


8 


13 


11 


9 


11 


12 


10 


15 


13 



Note that 

Pe[X>n \S-0}- pe[s ^ Q] = EZoP9[x = n , s = 0] 

_ En=n o PeiS = 0\X = n]P e [X = n} E~ n „ p 9 M/ 6+9 (n) 
E^o W = 0|X = n]Pg[X = n] E: =0 w(«)/h 9 («) ' 

Substituting pg{n) by p(n) in both the numerator and denominator in the last 
equation, we have 



a* g (n ) 



E,?=o P( n )fo+d{n) 



Since a* e (na) is strictly decreasing in 9 for any hq > 0, a modified type I error rate 
is proposed by 



a*(n ) = aJOo) 



E^= p( n )/fc( n ) ' 



Table [2] lists the least no for a (no) < 0.1 and a*(no) < 0.1 respectively for selected 
b. It can be seen that the values of no based on modified type I error rates are 
always no greater than those based on the classical type I error rates. 



5. Remarks 



An intrinsic testing problem in particle physics is to know the general conclusion 
of the possibility of the signal events in experiments, which can be described as 
a testing problem for the null hypothesis of Hq : 6 = versus the alternative 
hypothesis of Hi : 9 > 0. Since Pe[X = n\S = 0] = Pq[X = n], the two testing 
problems are equivalent if the inference is only based on observations of a single 
experiment. 

Conditioning on B < X when X — n in the signal plus background model 
was first proposed by Roe and Woodroofe [13]. Later Woodroofe and Wang [3] 
consider a testing problem for Hq : 9 > 9q versus Hi : 9 < 9q for a positive #o 
based on the condition B < n. Later on, Zhang and Woodroofe 14[ consider an 
estimation problem for the signal event still based on the condition B < n. This 
paper considers the estimation of the probability of the existence of signal events 
based on the condition B < n. 



Signal plus background model 
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